Low Complexity, Low Delay and Scalable Audio Coding Scheme Based on a Novel Statistical Perceptual Quantization Procedure
نویسندگان
چکیده
In this paper we present Fast Perceptual Quantization (FPQ), a novel procedure to quantize and code audio signals. It employs the same psychoacoustics principles used in the popular MPEG/Audio coders, but substantially simplifies the complexity and computational needs of the encoding process. FPQ is based on defining a hierarchy of privileged quantization values so that the masking threshold calculated through a psychoacoustic model is leveraged to quantize the real values to the privileged ones when possible. The computational cost of this process is very low compared to MP3’s or AAC’s quantization/coding loops. Experimental results show that it is possible to achieve nearly transparent coding using as few as approximately 100 quantization values. This leads to very efficient bit compaction using Huffman or arithmetic coding so that nearly state-of-the-art performance can be achieved in terms of quality/bit-rate trade-off. Since quantization and codification (bit compaction) procedures are completely independent here, efficient scalable decoding can be achieved either by parsing and entropy re-encoding the original quantized values or by coding the bit-planes independently and sorting them in order of perceptual significance. Very low delay performance is also possible to achieve, which makes the proposed coding scheme suitable for real-time applications.
منابع مشابه
Wavelet packet filterbanks for low time delay audio coding
We study the application of wavelet packet filterbanks to low bit-rate transparent audio coding, taking the audio coders’ delay requirements into account, and propose low-delay coders based on wavelet packet filterbanks. We first develop a method of comparison between filterbanks for perceptual audio coding by estimating the necessary bit-rate for a transparent compression. We use this comparis...
متن کاملWarped Linear Prediction for Improved Perceptual Quality in the Scelp Low Delay Audio Codec (w-scelp)
The SCELP (Spherical Code Excited Linear Prediction) audio codec, which has recently been proposed for low delay audio coding [5], is based on linear prediction (LP). It applies closed-loop vector quantization employing a spherical code which is based on the Apple Peeling code construction rule. Frequency warped signal processing is known to be beneficial especially in the context of wideband a...
متن کاملThe D5 lattice quantization for 64 kbit/s low-delay subband audio coder with a 15 kHz bandwidth
A new method for coding generic audio signals at 64 kbit/s in the 20-15000 Hz bandwidth with a low delay is presented. It combines subband coding, Low Delay CELP algorithm and cascaded lterbanks. Our earlier works [1] show that, when using an equal bit rate on each subband, the resulting audio quality was not appropriate. We propose a new technique based on lattice quantization to avoid the sea...
متن کاملImproving perceptual coding of narrowband audio signals at low rates
This paper discusses perceptual coding of narrowband audio signals at low rates. In particular, it proposes a new error measure which shapes the noise inside the critical bands, a window switching criterion based on the temporal masking effect of the hearing system, a more accurate model of the simultaneous masking effect of the hearing system, perceptually-based bit allocation algorithms based...
متن کاملA Nearly-transparent Low Delay Audio Coder
Nowadays, it does not exist an audio coding standard for getting nearly-transparent quality with low delay. The standard ISO-MPEG is profusely used in audio for getting high quality [1]. It uses a perceptual model that requires high frequency precision and introduces high delay. However, in applications where the delay is a critical parameter, e.g. when is important to use a feedback channel, t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007